The Feasibility of Scalable Video Streaming 
over Femtocell Networks 



Donglin Hu and Shiwen Mao 
Department of Electrical and Computer Engineering 

<N '• Auburn University, Auburn, AL, USA 

O 

(N 



o 

O: 
a^: 



Abstract 

In this paper, we consider femtocell CR networks, where femto base stations (FBS) are deployed to 
greatly improve network coverage and capacity. We investigate the problem of generic data multicast in 
t/3 . femtocell networks. We reformulate the resulting MINLP problem into a simpler form, and derive upper 

and lower performance bounds. Then we consider three typical connection scenarios in the femtocell 
' network, and develop optimal and near-optimal algorithms for the three scenarios. Second, we tackle 

, the problem of streaming scalable videos in femtocell CR networks. A framework is developed to 

m 

. captures the key design issues and trade-offs with a stochastic programming problem formulation. In 
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the case of a single FBS, we develop an optimum-achieving distributed algorithm, which is shown also 
■ optimal for the case of multiple non-interfering FBS's. In the case of interfering FBS's, we develop a 

greedy algorithm that can compute near-opitmal solutions, and prove a closed-form lower bound on its 
performance. 
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I. Introduction 

Due to the use of open space as transmission medium, capacity of wireless networks are usually 
limited by interference. When a mobile user moves away from the base station, a considerably 
larger transmit power is needed to overcome attenuation, while causing interference to other 
users and deteriorating network capacity. To this end, femtocells provide an effective solution 
that brings network infrastructure closer to mobile users. A femtocell is a small (e.g., residential) 
cellular network, with a femto base station (FBS) connected to the owner's broadband wireline 
network [[Il-[l3l. The FBS serves approved users when they are within the coverage. Among the 
many benefits, femtocells are shown effective on improving network coverage and capacity [1]. 
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Due to reduced distance, transmit power can be greatly reduced, leading to prolonged battery life, 
improved signal-to-interference-plus-noise ratio (SINR), and better spatial reuse of spectrum. 

Femtocells have received significant interest from the wireless industry. Although highly 
promising, many important problems should be addressed to fully harvest their potential, such 
as interference mitigation, resource allocation, synchronization, and QoS provisioning [[T]|, 
It is also critical for the success of this technology to support important applications such as 
real-time video streaming in femtocell networks. 

In this paper, we first investigate the problem of data multicast in femtocell networks. It is not 
atypical that many users may request for the same content, as often observed in wireline net- 
works. By allowing multiple users to share the same downlink multicast transmission, significant 
spectrum and power savings can be achieved. 

In particular, we adopt superposition coding (SC) and successive interference cancellation 
(SIC), two well-known PHY techniques, for data multicast in femtocell networks JH. With SC, 
a compound signal is transmitted, consisting of multiple signals (or, layers) from different senders 
or from the same sender. With SIC, a strong signal can be first decoded, by treating all other 
signals as noise. Then the decoder will reconstruct the signal from the decoded bits, and subtract 
the reconstructed signal from the compound signal. The next signal will be decoded from the 
residual, by treating the remaining signals as noise. And so forth. A special strength of the SC 
with SIC approach is that it enables simultaneous unicast transmissions (e.g., many-to-one or 
one-to-many). It has been shown that SC with SIC is more efficient than PHY techniques with 
orthogonal channels ffl, [[5l. 

We adopt SC and SIC for the unique femtocell network environment, and investigate how 
to enable efficient data multicast from the femtocells to multiple users. We formulate a Mixed 
Integer Nonlinear Programming (MINLP) problem, which is NP-hard in general. The objective 
is to minimize the total BS power consumption. Then we reformulate the MINLP problem into a 
simpler form, and derive upper and lower performance bounds. We also derive a simple heuristic 
scheme that assigns users to the BS's with a greedy approach. Finally, we consider three typical 
connection scenarios in the femtocell network, and develop optimal and near-optimal algorithms 
for the three scenarios. The proposed algorithms have low computational complexity, and are 
shown to outperform the heuristic scheme with considerable gains. 

Then, we investigate the problem of video streaming in femtocell cognitive radio (CR) net- 
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works. We consider a femtocell network consisting of a macro base station (MBS) and multiple 
FBS's. The femtocell network is co-located with a primary network with multiple licensed 
channels. This is a challenging problem due to the stringent QoS requirements of real-time videos 
and, on the other hand, the new dimensions of network dynamics (i.e., channel availability) and 
uncertainties (i.e., spectrum sensing and errors) found in CR networks. 

We adopt Scalable Video Coding (SVC) in our system Q, 0. SVC encodes a video into 
multiple substreams, subsets of which can be decoded to provide different quality levels for the 
reconstructed video [8J. Such scalability is very useful for video streaming systems, especially 
in CR networks, to accommodate heterogeneous channel availabilities and dynamic network 
conditions. We consider H.264/SVC medium grain scalable (MGS) videos, since MGS can 
achieve better rate-distortion performance over Fine-Granularity-Scalability (FGS), although it 
only has Network Abstraction Layer (NAL) unit-based granularity dHl. 

The unique femtocell network architecture and the scalable video allow us to develop a 
framework that captures the key design issues and trade-offs, and to formulate a stochastic 
programming problem. It has been shown that the deployment of femtocells has a significant 
impact on the network performance [IJ. In this paper, we examine three deployment scenarios. 
In the case of a single FBS, we apply dual decomposition to develop a distributed algorithm that 
can compute the optimal solution. In the case of multiple non-interfering FBS's, we show that 
the same distributed algorithm can be used to compute optimal solutions. In the case of multiple 
interfering FBS's, we develop a greedy algorithm that can compute near-optimal solutions, and 
prove a closed-form lower bound for its performance based on an interference graph model. 
The proposed algorithms are evaluated with simulations, and are shown to outperform three 
alternative schemes with considerable gains. 

The remainder of this paper is organized as follows. The related work is discussed in Section |Ill 
We investigate the problem of data multicast over fenmtocell networks in Section |nil The 
problem of streaming multiple MGS videos in a femtocell CR network is discussed in Section HVl 
Section |V] concludes this paper. 

II. Background and Related Work 

Femtocells have attracted considerable interest from both industry and academia. Technical 
and business challenges, requirements and some preliminary solutions to femtocell networks are 
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discussed in [[T]|. Since FBS's are distributedly located and are able to spatially reuse the same 
channel, considerable research efforts were made on interference analysis and mitigation [9], 
IfTOl . A distributed utility based SINR adaptation scheme was presented in |l9l to alleviate cross- 
tire interference at the macrocell from co-channel femtocells. Lee, Oh and Lee [UOll proposed a 
fractional frequency reuse scheme to mitigate inter-femtocell interference. 

Deploying femtocells by underlaying the macrocell has been proved to significantly improve 
indoor coverage and system capacity. However, interference mitigation in a two-tier heteroge- 
neous network is a challenging problem. In [ITTIl . the interference from macrocell and femtocells 
was mitigated by a spatial channel separation scheme with codeword-to-channel mapping. In [12J, 
the rate distribution in the macrocell was improved by subband partitioning and modest gains 
were achieved by interference cancellation. In [|13ll . the interference was controlled by denying 
the access of femtocell base stations to protect the transmission of nearby macro base station. 
A novel algorithmic framework was presented in [14] for dynamic interference management 
to deliver QoS, fairness and high system efficiency in LTE-A femtocell networks. Requiring 
no modification of existing macrocells, CR was shown to achieve considerable performance 
improvement when applied to interference mitigation fT5\. In |fT6l, the orthogonal time-frequency 
blocks and transmission opportunities were allocated based on a safe/victim classification. 

SIC has high potential of sending or receiving multiple signals concurrently, which improves 
the transmission efficiency ifTTl . In [[5]|, the authors developed MAC and routing protocols that 
exploit SC and SIC to enable simultaneous unicast transmissions. Sen, et al. investigated the 
possible throughput gains with SIC from a MAC layer perspective [18J. Power control for SIC 
was comprehensively investigated and widely applied to code division multiple access (CDMA) 
systems |fT9l - ll23l . Applying game theory, Jean and Jabbari proposed an uplink power control 
under SIC in direct sequence-CDMA networks [19|. In [20J, the authors introduced an iterative 
two-stage SIC detection scheme for a multicode MIMO system and showed the proposed scheme 
significantly outperformed the equal power allocation scheme. A scheme on joint power control 
and receiver optimization of CDMA transceivers was presented in [[2T]|. In [|22ll . Il23l . the impact 
of imperfect channel estimation and imperfect interference cancellation on the capacity of CDMA 
systems was examined. 

The problem of video over CR networks has only been studied in a few recent papers ll24l - 
ll27l . In our prior work, we investigated the problem of scalable video over infrastructure-based 
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CR networks [6] and multi-hop CR networks |I71. The preliminary results of video over femtocell 
CR networks were presented in ll28l . 

III. Multicast in Femtocell Networks with Superposition Coding and 
Successive Interference Cancellation 

In this section, we formulate a Mixed Integer Nonlinear Programming (MINLP) problem of 
data multicast in femotcell networks, which is NP-hard in general. Then we reformulate the 
MINLP problem into a simpler form, and derive upper and lower performance bounds. We 
also derive a simple heuristic scheme that assigns users to the BS's with a greedy approach. 
Finally, we consider three typical connection scenarios in the femtocell network, and develop 
optimal and near-optimal algorithms for the three scenarios. The proposed algorithms have low 
computational complexity, and are shown to outperform the heuristic scheme with considerable 
gains. 

A. System Model and Problem Statement 

1) System Model: Consider a femtocell network with an MBS (indexed 0) and M FBS's 
(indexed from 1 to M) deployed in the area. The M FBS's are connected to the MBS and the 
Internet via broadband wireline connections. Furthermore, we assume a spectrum band that is 
divided into two parts, one is allocated to the MBS with bandwidth Bq and the other is allocated 
to the M FBS's. The bandwidth allocated to FBS m is denoted by Bm- When there is no overlap 
between the coverages of two FBS's, they can spatially reuse the same spectrum. Otherwise, 
the MBS allocates disjoint spectrum to the FBS's with overlapping coverages. We assumed the 
spectrum allocation is known a priori. 

There are K mobile users in the femtocell network. Each user is equipped with one transceiver 
that can be tuned to one of the two available channels, i.e., connecting to a nearby FBS or to 
the MBS. The network is time slotted. We assume block-fading channels, where the channel 
condition is constant in each time slot [4l|. We focus on a multicast scenario, where the MBS 
and FBS's multicast a data file to the K users. The data file is divided into multiple packets 
with equal length and transmitted in sequence with the same modulation scheme. Once packet / 
is successfully received and decoded at the user, it requests packet (/ + 1) in the next time slot. 
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Fig. 1. Superposition coding and successive interference cancellation. 

We adopt SC and SIC to transmit these packets JH, as illustrated in Fig. [T] In each time 
slot t, the compound signal has L layers (or, levels), denoted as Di{t), ■ ■ ■ , DL{t). Each level 
i = 1, ■ ■ ■ , L, is a packet requested by some of the users in time slot t. A user that has 
successfully decoded Di(t), for alH = 1, ■ ■ ■ , / — 1, is able to subtract these signals from the 
received compound signal and then decodes Di(t), while the signals from Di^i{t) to Di^t) are 
treated as noise. 

2) Problem Statement: For the SC and SIC scheme to work, the transmit powers for the levels 
should be carefully determined, such that there is a sufficiently high SNR for the levels to be 
decodable. It is also important to control the transmit powers of the BS's to reduce interference 
and leverage frequency reuse. The annual power bill is a large part of a mobile operator's 
costs |[29l . Minimizing BS power consumption is important to reduce not only the operator's 
OPEX, but also the global CO2 emission; an important step towards "green" communications. 

Therefore, we focus on BS power allocation in this paper. The objective is to minimize the 
total power of all the BS's, while guaranteeing a target rate Rtar for each user. Recall that the 
data file is partitioned into equal-length packets. The target rate Rtar ensures that a packet can 
be transmitted within a time slot, for given modulation and channel coding schemes. 

Define binary indicator J,^, for all m and k, as: 

, 1, if user k connects to BS m 

It={ (1) 
I 0, otherwise. 

Consider a general time slot t when L data packets, or levels, are requested. We formulate the 
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optimal power allocation problem (termed OPT-Power) as follows. 

M L 

minimize: >^ >^ P,™ 




(2) 



m=0 1=1 



subject to: 5„log2(l + it^t) > RtariL for all k 



(3) 



M 



Vj^ = 1 



for all k 



(4) 



m=0 



P;™ > 0, for all /, m, 



(5) 



where P^*" is the power of BS m for transmitting the level / packet; 7^ is the SNR at user k 
if it connects to BS m. Constraint (|3]) guarantees the minimum rate at each user. Constraint Q 
is due to the fact that each user is equipped with one transceiver, so it can only connect to one 



Let Ui denote the set of users requesting the level / packet. A user k eUi has decoded all the 
packets up to Di_i. It subtracts the decoded signals from the received signal and treats signals 
Di^i, ■ ■ ■ , Dl as noise. The SNR at user k E Ui, for / = 1, ■ ■ ■ , L — 1, can be written as: 



\ i=i+i / 

where iJ^ is the random channel gain from BS m to user k and Nq is the noise power. For user 
k eUl that requests the last packet D^, the SNR is 



The optimization variables in Problem OPT-Power consist of the binary variables J^'s and 
the continuous variables P™'s. It is an MINLP problem, which is NP-hard in general. In 
Section IIII-Bl we first reformulate the problem to a obtain a simpler form, and then develop 
effective algorithms for optimal and suboptimal solutions. 

B. Reformulation and Power Allocation 

In this section, we reformulate Problem OPT-Power to obtain a simpler form, and derive 
an upper bound and a lower bound for the total BS power. The reformulation also leads to a 
simple heuristic algorithm. Finally, we introduce power allocation algorithms for three connection 
scenarios. 



BS. 




(6) 




(7) 
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1 ) Problem Reformulation: Due to the monotonic logarithm functions and the binary indica- 
tors J^, constraint Q can be rewritten as: 

'y^ >T^ m = Q I ■■■ M (8) 

where = Tm =■ 2^*'"^/^™ — 1 is the minimum SNR requirement at user k that connects to 
BS m. To further simplify the problem, define = J2i=i with Q^+i ~ Then power 
is the difference 

Pr = QT - QT+i- (9) 

Problem OPT-Power can be reformulated as: 

M 

minimize (10) 

m=0 

rfc ^ rfc \ "P 



ml 



subject to: HI{QT -QT+i)l [N.+H^QT^,) > T^K 

for all k eUi, I = 1, - ■■ ,L (11) 

Qr>Qm,^ = l,---,^ (12) 

M 

= 1, for all k. (13) 

m=0 

For / < L, constraint (fTTI) can be rewritten as: 

gr/^ > [NoTjH'^, + (1 + F^)QI!ti] (14) 

Let be the subset of users connecting to BS m in W^. Since > (fT4l) can be rewritten 

as, 

= max |q™i, max [NoTJH'^ + (1 + F„)Q™i] } • (15) 
From ([B]), we define a function Qf^ = Fra{QT+iMr) as: 

FmiQ^Mj ) = < f ^ (16) 

^ \ ra^^^eur + (1 + r™)Qr+i} , ^ 0. 

Obviously, FmiQ^liMD is non-decreasing with respect to QYli- It follows that 

QT = FmiQTMT') = Fm{Fm{QT MT) MT) 

= Fm{- ■ ■ (Fm(Q™+i,W™),W™_i), ■ ■ ■ ,W{") 

= Fm{---iFUOML)ML-i),--- MD- (17) 
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If none of the subsets (/ = 1, ■ ■ ■ , L) is empty, we can expand the above recursive term 
using (fT6l) . It follows that 

L 

QT = iVor„ + r^)^r max , (18) 

1=1 ' 

where the exponent cj" is defined as c^* = and = + Otherwise, if a subset Ul^ = for 
some m, we have that Qr = Q7+i^ maxkeup {^/H^} = maxjtgo {l/if.^} = 0, and c[" = c]^^. 
Eq. ([I8]) still holds true. 

Finally, the objective function (flOl) can be rewritten as 

M L 

NoTm 5^(1 + r„)^™ max {l/i/^} . (19) 

m=0 Z=l ' 

Since (1 + F^) > 0, it can be seen that to minimize the total BS power, we need to keep the 
c™'s as low as possible. 

2 ) Performance Bounds : The reformulation and simplification allow us to derive performance 
bounds for the total BS power consumption. First, we derive the upper bound for the objective 
function (flOl) . Define a variable 

Gm= max max {F^/if^j , (20) 

1(^{1,--- ,L}k£UJ^ ' 

which corresponds to the user with the worst channel condition among all users that connect to 
BS m. It follows that: 

M ML 

E = A^o E + Trnr max {rjHt} 

m=0 m=0 1=1 ' 

M L 

<iVo5^5^(l + F^)^rG„ 

m=0 1=1 
M L 

<iVo5ZG„ J](l + F^)'-i 

m=0 1=1 
M 

= NoY,Gm [(1 + F„)^ - 1] /F„. (21) 

m=0 

In (|2TI) . the first inequality is from the definition of Gm- The second inequality is from the 
definition of c[^^. Specifically, c™ = 0; when ^ 0, we have c™ = c™ + 1; when W/" = 0, 
we have = c™]^. It follows that < / — I. Therefore, (|2TI) is an upper bound on the objective 
function (flOl) . 
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Furthermore, by defining G = max^gjo,... ,m} {Gm}, and T = max^Gio, - ,m} {rm}, we can 
get a looser upper bound from [2l] as 



M 

J2qT< NoG{m + 1) [(1 + r)^ - 1] /r. (22) 

m=0 

Next, we derive a lower bound for (flOl). Define 



r = minme{o,- -,M} {^m} ■ 



(23) 



We have that 



M M 

m=0 i=l 
M L 

m=0 1=1 
L M 

1=1 m=0 
L 

>iVo(M + i)E^'(i + r)^ 



m=0 m=0 ;=1 



Af+1 
1=1 

L 



> A/'o(M + l)E^'(l + r)i^T+i. (24) 

1=1 

In (l24l) . the first inequality is from the definition of G'. The second inequality is due to the 
definition of £. The third inequality is due to the fact that (1 + F)'^" is a convex function. The 
fourth inequality is because that each level must be transmitted by at least one BS. Thus for 
each level /, there is at least one c™ = dj^-^ + 1 for some m. It follows that the sum Y1!^=q 
should be greater than / — 1. Therefore, (|24l) provides a lower bound for (flOl) . 

Furthermore, by defining G_ = minigji^... {G^}, we can obtain a looser lower bound from 
(I24l) as 

yQT> NoG{M + l y , . (25) 

^0 (i + r)— -1 

3) A Simple Heuristic Scheme : We first describe a greedy heuristic algorithm that solves 
OPT- Power with suboptimal solutions. With this heuristic, each user compares the channel gains 
from the MBS and the FBS's. It chooses the BS with the best channel condition to connect to, 
thus the values of the binary variables are determined. Once the binary variables are fixed, 

10 



all the subsets UJ^'s are determined. Starting with Q^_^_l = 0, we can apply (fT5l) iteratively to 
find the QJ^'s. Finally, the transmit powers can be computed using (|9l). 

With this approach, among the users requesting the level / packet, it is more likely that some of 
them connect to the MBS and the rest connect to some FBS's, due to the random channel gains 
in each time slot. In this situation, both MBS and FBS will have to transmit all the requested 
data packets. Such situation is not optimal for minimizing the total power, as will be discussed 
in Section IIII-B4b[ 

4) Power Allocation Algorithms : In the following, we develop three power allocation algo- 
rithms for three different connection scenarios with a more structured approach. 

a) Case I-One Base Station: We first consider the simplest connection scenario where all 
the K users connect to the same BS (i.e., either the MBS or an FBS). Assume all the users 
connect to BS m. Then we have = 1 for all k, and all the subsets UJ^ axe non-empty; = 
for all k and all m' ^ m, and all the subsets Up' are empty for m' ^ m. 

From (fT6l) . we can derive the optimal solution as: 

Qr = (i + r^)gri + max{iVoWi7^}, 

L 

= iVor„V(l + r^)^-'max{l/i7^}, / = 1,2,---,L. (26) 

i=l ' 

Recall that Q^^i = Ql^i = 0, the optimal power allocation for Problem OPT-Power in this 
case is: 

f Qy* - Q?l\ , m' = m, for all / 

pni* ^ } ^i+l' ' (27) 

I 0, m! ^ m, for all /. 

b) Case II-MBS and One FBS : We next consider the case with one MBS and one FBS 
(i.e., M = 1), where each user has two choices: connecting to either the FBS or the MBS. 

Recall that and Ul are the subset of users who connected to the MBS and the FBS, 
respectively, and who request the level / packet. Examining (fTSi) . we find that the total power of 
BS m can be significantly reduced if one or more levels are not transmitted, since the exponent 
c™ will not be increased in this case. Furthermore, consider the two choices: (i) not transmitting 
level /, and (ii) not transmitting level /' > / from BS m. The first choice will yield larger power 
savings, since more exponents (i.e., 0^,0^^, ■■■ ,c™_i) will assume smaller values. Therefore, 
we should let these two subsets be empty whenever possible, i.e., either W° = or = 0. 
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TABLE I 

Power Allocation Algorithm For Case II 



1: 
2: 


Initialize all cf , cj, Q%^i and Q^+i 

FOR i = no L 


to zero; 


3: 


IF (ro(i + ro)'=°G? < ri(i + ri) 




4: 


Set U° = Wi and Ul = 0; 




5: 
6: 


c? = c? + 1; 
ELSE 




7: 
8: 
9: 


Set Ui = and Ui = 6/^; 

1 111 
cf = + 1; 

END IF 




10: 


END FOR 




11: 


FOR ; = i TO 1 




12: 


Q" = J'o(Q?+i,Wf) and P0 = Q» 




13: 


Qi =-Fi(Qi+i,Wi')andP,i=0,i 


~ Q}+i' 


14: 


END FOR 





According to this policy, all the users requesting the level / packet will connect to the same BS. 
We only need to make the optimal connection decision for each subset of users requesting the 
same level of packet, rather than for each individual user. 

Since not transmitting a lower level packet yields more power savings for a BS, we calculate 
the power from the lowest to the highest level, and decide whether connecting to the MBS or 
the FBS for users in each level. Define G° = max^g^^ |l/ifg} and Gj = max^^u^ 
The algorithm for solving Problem OPT-Power in this case is given in Table HI In Steps 2-10, 
the decision on whether connecting to the MBS or the FBS is made by comparing the expected 
increments in the total power. The user subsets W° and are determined in Steps 4 and 7. In 
Steps 11-14, Q'P's and the corresponding P™'s are computed in the reverse order, based on the 
determined subsets W° and W/. 

The computational complexity of this algorithm is 0{L). 

c) Case III-MBS and Multiple FBS's : Finally, we consider the general case with one MBS 
and multiple FBS's in the network. Each user is able to connect to the MBS or a nearby FBS. 
Recall that we define Ui as the set of users requesting the level / packet, and UJ^ as the subset 
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of users in Ui that connect to BS m. These sets have the following properties. 

U]^f]Up' = 0, for all m' ^ m. 
The first property is due to the fact that each user must connect to the MBS or an FBS. The 
second property is because each user can connect to only one BS. The user subsets connecting 
to different BS's do not overlap. Therefore, W™'s is a partition of Ui with respect to m. 

In addition, we define 5™ as the set of possible users that are covered by BS m and request 
the level / packet. These sets have the following properties. 

< SJ" n = for all m 7^ 

5™ = 0, for all m' ^ m and m, m' ^ 0. 

The first property is because all users in each femtocell are covered by the MBS. The second 
property indicates that the users covered by FBS m are a subset of the users covered by the 
MBS. The third property shows that the user subsets in different femtocells do not overlap. We 
can see that the SJ^'s, for m = 1, ■ ■ ■ , M, are also a partition of Ui. 

Define Wm{V() = laaxkeu {V-^m}' where U is the set of users and m = 0, ■ ■ ■ , M. If the 
set U is empty, we define Wmi^) = 0. For example, consider Case II where M = 1. We have 
Sf = Si = Uu WoiMi) = Gl and Wiipii) = G\. 

The power allocation algorithm for Case III is presented in Table |IIl The algorithm iteratively 
picks users from the eligible subset and assigns them to the allocated subset W/". In each 
step /, \1/ is the subset of FBS's that will transmit the level / packet; the complementary set \1/ 
is the subset of FBS's that will not transmit the level / packet. The expected increment in total 
power for each partition is computed, and the partition with the smallest expected increment will 
be chosen. A™ is the power of BS m for transmitting the level / data packet. In Steps 6-15, the 
MBS and FBS combination ^! is determined for transmitting the level / packet, with the lowest 
power Aq. In Steps 16-30, elements in 5™ are assigned to according to . In Steps 31-35, 
power sums QJ" and the corresponding power allocations are calculated in the reverse order 
from the known UJ^'s. 

The complexity of the algorithm is 0{ML). 
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TABLE II 

Power Allocation Algorithm For Case III 



1: Initialize: = and Q^+i = 0, for all I, m; 

2: FOR Z = 1 TO L 

3: FOR m = TO M 

4: = r^(i + rrr.y"^w^{sn; 

5: END FOR 

6: Set n = {I,--- ,M} and * = 0; 

7: WHILE (fi ^ 0) 

8: m' = argmin^gn AJ"; 

9: Compute A' = ro(l + r„)=? W,(U^gro5r); 

10: IF((E^,^,„,Ar + A')<Ao) 

11: Add rn' to *; 

12: Ao = E^e* + ^'^ 

13: END IF 

14: Remove m' from f2; 

15: END WHILE 

16: IF (* = 0) 

17: W« = 5°; 

18: c? = c? + 1; 

19: Set W/" = 0, for all m ^ 0; 

20: ELSE 

21: Wf=U„,^>Sr; 

22: IF (1*1 < M) 

23: c? = c? + 1; 

24: END IF 

25: FOR me* 

26: cr = cr + 1; 

27: ur = sr\ 

28: END FOR 

29: END IF 

30: END FOR 

31: FOR / = L TO 1 

32: FOR m = TO M 

33: QT = F^{Qr+uUn and = QT - QT+u 

34: END FOR 

35: END FOR 
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C. Performance Evaluation 

We evaluate the performance of the proposed power allocation algorithms using MATLAB™. 
Three scenarios corresponding to the three cases in Section IIII-BI are simulated: (i) Case I: a 
single MBS; (ii) Case II: one MBS and one FBS; and (iii) Case III: one MBS and three FBS's. 

Since we do not find any similar schemes in the literature, we made the following comparisons. 
First, we compare Cases I and II with respect to BS power consumption and interference footprint. 
In both cases, there are = 8 users and L = A levels. In Case I, the MBS bandwidth h Bq = 2 
MHz. In Case II, the MBS and the FBS share the 2 MHz total bandwidth; the MBS bandwidth 
is i?o = 1 MHz and the FBS bandwidth is i?i = 1 MHz. The target data rate Rtar is set to 2 
Mbps. The channel gain from a base station to each user is exponentially distributed in each 
time slot. 

The interference footprints in the three dimensional space are plotted in Fig. [21 The height B 
of the cylinders indicates the spectrum used by a BS, while the radius r is proportional to the 
BS transmit power. In Case I when only the MBS is used, the total BS power is 45.71 dBm 
and the volume of the cylinder is nr'^B = 18, 841 MHz m^. In Case II when both the MBS and 
FBS are used, the total BS power is 34.58 dBm and the total volume of the two cylinders is 
2, 378 MHz m^. Using an additional FBS achieves a 11.13 dB power saving and the interference 
footprint is reduced to 12.62% of that in Case I. This simple comparison clearly demonstrate 
the advantages of femtocells achieved by bringing BS's closer to users. 

We next consider the more general Case III, using a femtocell network of one MBS and 
three FBS's. The MBS bandwidth is Bq = 1 MHz and each FBS is assigned with bandwidth 
Bm = I MHz, m = 1, 2, 3. The target data rate is still 2 Mbps. In Figs. |3] and S we plot 
four curves, each obtained with: (i) the heuristic scheme described in Section IIII-B3i (ii) The 
proposed algorithm presented in Section ITlI-B 4c i (iii) The upper bound; and (iv) the lower bound 
derived in Section IIII-B2I Each point in the figures is the average of 10 simulation runs. The 
95% confidence intervals are plotted as error bars, which are all negligible. 

In Fig. m we examine the impact of the number of packet levels L on the total BS transmit 
power. We increase L from 2 to 6, and plot the total power of base stations. As expected, the 
more packet levels, the larger the BS power consumption. Both the proposed and heuristic curves 
lie in between the upper and lower bound curves. When L is increased from 2 to 6, the power 
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Case I: MBS Case II: MBS and one FBS 




Fig. 2. Case I vs. Case II: interference footprints. 




Number of Levels (L) 



Fig. 3. Case III: impact of number of levels L. 



consumption of the heuristic scheme is increased by 12.22 dB, while the power consumption of 
the proposed algorithm is increased by 9.94 dB. The power savings achieved by the proposed 
algorithm over the heuristic scheme range from 3.92 dB to 6.45 dB. 

In Fig. m we show the impact of the BS bandwidths. The number of levels is L = 4. We fix 
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Fig. 4. Case III: impact of MBS bandwidtii Bq. 

the total bandwidth at 2 MHz, which is shared by the MBS and FBS's. We increase the MBS 
bandwidth from 0.4 MHz to 1.6 MHz in steps of 0.2 MHz, while decrease the bandwidth of 
FBS's from 1.6 MHz to 0.4 MHz. We find that the total power consumption is increased as 
Bq gets large. This is due to the fact that as the PBS bandwidth gets smaller, the FBS's have 
to spend more power to meet the minimum data rate requirement. The curve produced by the 
proposed algorithm has a smaller slop than that of the heuristic scheme: the overall increase 
in the total power of the proposed algorithm is 4.86 dB, while that of the heuristic scheme is 
20.84 dB. This implies that the proposed scheme is not very sensitive to the bandwidth allocation 
between the MBS and FBS's. The proposed algorithm also achieves consider power savings over 
the heuristic scheme. When Bq = 1.6 MHz, the total power of the proposed algorithm is 20.75 
dB lower than that of the heuristic scheme. 

IV. Video over CR Femtocell Networks 

In this section, we investigate the problem of video streaming in femtocell cognitive radio 
(CR) networks and formulate a stochastic programming problem to examine three deployment 
scenarios. In the case of a single FBS, we apply dual decomposition to develop an optimum- 
achieving distributed algorithm, which is shown also optimal for the case of multiple non- 
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interfering FBS's. In the case of multiple interfering FBS's, we develop a greedy algorithm that 
can compute near-optimal solutions, and prove a closed-form lower bound for its performance 
based on an interference graph model. The proposed algorithms are evaluated with simulations, 
and are shown to outperform three alternative schemes with considerable gains. 

A. System. Model and Preliminaries 

1) Spectrum and Network Model: We consider a spectrum consisting of (M + 1) channels, 
including one common, unlicensed channel (indexed as channel 0) and M licensed channels 
(indexed as channels 1 to M). The M licensed channels are allocated to a primary network, and 
the common channel is exclusively used by all CR users. We assume all the channels follow a 
synchronized time slot structure ll30l . The capacity of each licensed channel is Bi Mbps, while 
the capacity of the common channel is Bq Mbps. The channel states evolve independently, while 
the occupancy of each licensed channel follows a two-state discrete-time Markov process. 

The femtocell CR network is illustrated in Fig. [5l There is an MBS and FBS's deployed 
in the area to serve CR users. The N FBS's are connected to the MBS (and the Internet) via 
broadband wireline connections. Due to advances in antenna technology, it is possible to equip 
multiple antennas at the base stations. The MBS has one antenna that is always tuned to the 
common channel. Each FBS is equipped with multiple antennas (e.g., M) and is able to sense 
multiple licensed channels at the beginning of each time slot. There are Ki CR users in femtocell 
i, i = 1,2,- ■■ ,N, and J2i=i = ^- Each CR user has a software radio transceiver, which 
can be tuned to any of the M+l channels. A CR user will either connect to a nearby FBS using 
one or more of the licensed channels or to the MBS via the common channel. 

Although the CR users are mobile, we assume constant topology during a time slot. If the 
topology is changed during a time slot, the video transmission will only be interrupted for 
the time slot, since the proposed algorithms are executed in every time slot for new channel 
assignment and schedule. 

2) Spectrum Sensing and Access: The femtocell CR network is within the coverage of the 
infrastructure-based primary network. Both FBS's and CR users sense the channels to identify 
spectrum opportunities in each time slot. Each time slot consists of (i) a sensing phase, when CR 
users and FBS's sense licensed channels, (ii) a transmission phase, when CR users and FBS's 
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Fig. 5. A femtocell CR network with one MBS and four FBS's. 



attempt to access licensed channels, and (ill) an acknowledgment phase, when acknowledgments 
(ACK) are returned to the source. 

Cooperative sensing policy is also adopted here. We also adopt a hypothesis test to detect 
channel availability. We assume that each CR user chooses one channel to sense in each time 
slot, since it only has one transceiver. The sensing results will be shared among CR users and 
FBS's via the common channel in the sensing phase. Given L sensing results on channel m, the 
availability of channel m, i.e., P^(6™, ■ ■ ■ , 0^), can be computed iteratively as follows. 
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(29) 



We adopt a probabilistic approach: based on sensing results 6^, we have Dm{t) = with 
probability Pj^(Qm) and Dm{t) = 1 with probability 1 — P^(9m). For primary user protec- 
tion, the collision probability with primary users caused by CR users should be bounded. The 
probability P^(9m) is determined as follows 

P^iQm) = min \l - P^(e„)l , l| . (30) 
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Let A{t) := {m\Dmit) = 0} be the set of available channels in time slot t. Then G* = 
^mGyt(t) -^m(®]") expected number of available channels. These channels will be accessed 
in the transmission phase of time slot t. 

3) Channel Model: Without loss of generality, we consider independent block fading channels 
that is widely used in prior work Bill . The channel fading-gain process is piecewise constant 
on blocks of one time slot, and fading in different time slots are independent. Let /^"'(x) denote 
the probability density function of the received SINR X from a base station i at CR user j. We 
assume the packet can be successfully decoded if the received SINR exceeds a threshold H. 
The packet loss probability from base station i to CR user j is 

P,, = Pr{X <H}= r fi\x)dx = Fil\H), (31) 

Jo 

where Flf{H) is the cumulative density function of X. 

In the case of correlated fading channels, which can be modeled as finite state Markov 
Process [f32l . the packet loss probability in the next time slot can be estimated from the known 
state of the previous time slot and the transition probabilities. If the packet is successfully 
decoded, the CR user returns an ACK to the base station in the ACK phase. We assume ACKs 
are always successfully delivered. 

4) Video Performance Measure: We assume each active CR user receives a real-time video 
stream from either the MSB or an FSB. Without loss of generality, we adopt the MGS option 
of H.264/SVC, for scalability to accommodate the high variability of network bandwidth in CR 
networks. 

Due to real-time constraint, each Group of Pictures (GOP) of a video stream must be delivered 
in the next T time slots. With MGS, enhancement layer NAL units can be discarded from 
a quality scalable bit stream, and thus packet-based quality scalable coding is provided. Our 
approach is to encode the video according to the maximum rate the channels can support. During 
transmission, only part of the MGS video gets transmitted as allowed by the current available 
channel bandwidth. The video packets are transmitted in decreasing order of their significance 
in decoding. When a truncated MGS video is received and decoded, the PSNR is computed by 
substituting the effective rate of the received MGS video into (|32l) given below, thus the original 
video is not required. 

Without loss of generality, we assume that the last wireless hop is the bottleneck; video data 
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Fig. 6. Rate-distortion curves of three H.264/SVC MGS videos. 



is available at the MBS and FBS's when they are scheduled to be transmitted. The quality of 
reconstructed MGS video can be modeled as [Sj: 

W{R) = a + /3xR, (32) 

where W{R) is the average peak signal-to-noise ratio (PSNR) of the reconstructed video, R is 
the received data rate, a and (3 are constants depending on the video sequence and codec. 

We verified (|32l ) using an H.264/SVC codec and the Bus, Mobile, and Harbour test sequences. 
In Fig. [6l the markers are obtained by truncating the encoded video's enhancement layer at dif- 
ferent positions to obtain different effective rates, while the curves are computed using (|32l) . The 
curves fit well with measurements for the three sequences. It is worth noting that PSNR may not 
be a good measure of video quality as compared with alternative metrics such as MS-SSIM [33] . 
The main reason for choosing PSNR is that there is a closed-form model relating it to network 
level metrics-video rate. With the closed-form model, we can have a mathematical formulation of 
the scheduling/resource allocation problem, and derive effective algorithms. Should such closed- 
form models be available for MS-SSIM, it is possible to incorporate it into the optimization 
framework as well. 
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B. MGS Video over Femtocell CR Networks 

In this section, we address the problem of resource allocation for MGS videos over femtocell 
CR networks. We first examine the case of a single FBS, and then the more general case of 
multiple non-interfering or interfering FBS's. The algorithms for the single and non-interfering 
FBS cases are distributed ones and optimal. The algorithm for the interfering FBS case is a 
centralized one that can be executed at the MBS. To simplify notation, we omit the time slot 
index t for most of the variables in this Section. For example, x represents a variable for time 
slot t, represents the variable in time slot (t — 1), and x'^ represents the variable in time slot 

1) Case of Single FBS: 

a) Formulation: We first consider the case of a single FBS in the CR network, where the 
FBS can use all the G available channels to stream videos to K active CR users. Let Wj be 
the PSNR of CR user j at the beginning of time slot t and Wj the PSNR of CR user j at the 
end of time slot t. In time slot t, Wj is already known; Wj is a random variable that depends 
on channel condition and primary user activity; and Wj' is a realization of Wj. Let and ^ij 
indicate the random packet losses from the MBS and FBS, respectively, to CR user j in time 
slot t. That is, is 1 with probability Pi j = 1 — Pi j and with probability Pij. Due to block 
fading channels, Pi/s do not change within the time slot. 

Let pqj and pij be the portions of time slot t when CR user j receives video data from the 
MBS and FBS, respectively. The average PSNR is computed every T time slots. We first have 
Wj{0) = aj, when t = 0. In each time slot t, the CR user receives ^ojPoj-Bo bits through the 
MBS, and ^ijpijGBi bits through the FBS (assuming that OFDM is used), which contribute an 
increase of I3{^qjPqjBq+^ijPijGBi)/T to the total PSNR in this T time slot interval, according 
to (|32l) . Therefore we have the following recursive relationship: Wj = W~ + /3(^o,iPo,i-Bo + 
^^jPijGB^)/T = W- + UjPojRoj + ^ijPijGRi^,, where Roj = ^Bq/T and R,j = f3B,/T. 

For proportional fairness, we aim to maximize the sum of the logarithms of the PSNRs of all 
CR users [[34l . We formulate a multistage stochastic programming problem by maximizing the 
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expectation of the logarithm-sum at time T. 

K 

maximize: ^E[log(W,(T))] (33) 
subject to: Wj = W' + ^ojpojRoj + ^ijpijGRij, j = 1, ■ ■ ■ , K, t = 1, ■ ■ ■ ,T 

K 

^Pi,j < 1, i = 0, 1, t = 1, ■ ■ ■ ,T 

i=i 

pi,j > 0, i = 0,l, J = 1, ■ ■ ■ , ^, t = l,--- ,T. 

Roj = l3jBo/T and Rij = (5jBi/T are constants for the j-th MGS video. 

At the beginning of the last time slot T, a realization $,[t-i] = [^1,^2, ■ ■ ■ ,^r-i] is known, 
where = [-^0,1 > ^0,2, ■■■ , ^Ik, ■■■ , Ci,k]' t = 1, 2, • • ■ , T - 1. It can be shown that the 
multistage stochastic programming problem (|33l) can be decomposed into T serial sub-problems, 
each to be solved in a time slot t [|7]|. 

K 

maximize: ^E{log(W,)|^[i„i]} (34) 
subject to: Wj = W~ + ^ojpojRoj + ^ijpijGRijJ = 1,--- ,K 

K 

^Pi,j < 1, i = 0,l 

pi,j > 0, « = 0, 1, j = 1, - ■ ■ 

where E{log(M/'j)|^[t_i]} denotes the conditional expectation of log(iyj) given realization $,[t-i\- 
W~ is known given the realization. When t = 1, the conditional expectation becomes an 
unconditional expectation. 

Since a CR user has only one transceiver, it can operate on either one or more licensed 
channels (i.e., connecting to the FBS) or the common channel (i.e., connecting to the MBS), but 
not both simultaneously. Assume CR user j operates on the common channel with probability 
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Pj and one or more licensed channels with probability qj. We then rewrite problem (|34l) as 



Pj + 9j = 1> j = 1, ■ ■ ■ ,^ 

Pij, Pj, gi > 0, z = 0, 1, j = 1, ■ ■ ■ , i^'. 

i^j Properties: In this section, we analyze the formulated problem (l35l) and derive its 
properties. We have Lemmas 1, 2, and 3 and Theorem 1 and provide the proofs in the following. 

Lemma 1. Problem ( |35l) is a convex optimization problem. 

Proof: First, it can be shown that the single termp^Poj log(^j~+Po,j-Ro,j)+Q'i-Pi,j ^og{W~ + 
PijGRij) is a concave function, because its Hessian matrix is negative semi-definite. Then, the 
objective function is concave since the sum of concave functions is also concave. Finally, all the 
constraints are linear. We conclude that problem (|35l) is convex with a unique optimal solution. 

■ 

Lemma 2. If [p, p, g] is a feasible solution to problem 051) . then [p, q, p] is also feasible. 

Proof: Since [p,p,q] is feasible, we have p + q = 1. Switching the two probabilities, we 
still have q + p = 1. Therefore, the derived new solution is also feasible. ■ 

Lemma 3. Let the optimal solution be [p*,p*,q*]- If p* > q*, then pQ^jXogiWf^ + p^jRoj) is 
greater than or equal to Pi^j\og{W~ + pljGRij). And vice versa. 

Proof: Assume Pqj \og{Wf + p^jRoj) is less than Pij \og{Wf + pljGRij). Since p* > q*, 
the sum of the product p*Poj logiWf + p^jRoj) + qjPij log{W~ + pljGRij) is smaller than 
the sum of the product q*Poj\og{W~ + p^jRoj) + p*Pij\og{W~ + pl jGRij). Thus we can 
obtain an objective value larger than the optimum by switching the values of p* and q*, which 
is still feasible according to Lemma 2. This conflicts with the assumption that [p*,p*,q*] is 
optimal. The reverse statement can be proved similarly. ■ 



K 



maximize: ^[pjPoj\og{Wj + poj^oj) + QjPij^ogiWj + pijGRij)] 



K 



subject to: 
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Theorem 1. Let the optimal solution be [p*,p*, g*]. IfPj> q], then we have v] = ^ <^nd q* = 0. 
Otherwise, we have Pj=Q and q* = 1. 

Proof: If p* > q*, we have Pqj ^og{W~ + p^ jRoj) > Pij \og{W~+pl jGRij) according to 
Lemma 3. Since the objective function is linear with respect to pj and qj, the optimal value can 
be achieved by setting pj to its maximum value 1 and qj to its minimum value 0. The reverse 
statement can be proved similarly. ■ 
According to Theorem 1, a CR user is connected to either the MBS or the FBS for the entire 
duration of a time slot in the optimal solution. That is, it does not switch between base stations 
during a time slot under optimal scheduling. 

c) Distributed Solution Algorithm: To solve problem (|35l) . we define non-negative dual 
variables X = [Aq, Ai] for the two inequality constraints. The Lagrangian function is 

K 

C{p, p,X) = [P^P^'^ + /^o.^-Ro,,) + (1 - P,) A,, hg{Wr + p.jGR,,)] + 

K K 

Ao(l-^Po,i) + Ai(l-5^Pi,,) 

i=i i=i 

K 

Ao,Ai) + Ao + Ai, (35) 

where 

^APv Po,j^ Pi,j^ -^0, Ai) = PjPoj \og{W~ + po,jRo,j) + 
(1 - Pj)Pi,j \og{Wf + pijGRij) - AoPoj - Aipij. 

The corresponding problem can be decomposed into K sub-problems and solved iteratively. In 
Step T > 1, for given Ao(t) and Ai(r) values, each CR user j solves the following sub-problem 
using local information. 

[Pi(^)'^o,j(^)'Pi,i(^)] = argmax poj, pij, Ao(r), Ai(r)). (36) 

P],Po,j,Pi,j>0 



There is a unique optimal solution since the objective function in (1361) is concave. The CR users 
then exchange their solutions. The master dual problem, for given p(r) and p(r), is: 



K 

min£(p(r),p(r), A) = ^ £j(pj(r), poj(r), pij(r), Aq, Ai) + Aq + Ai. (37) 
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TABLE III 

Algorithm for the Case of Single FBS 



1: 

2: 

3: 
4: 

5: 

6: 

7: 

8: 

9: 

10: 

11: 



Set T = 0, Ao(0) and Ai(0) to some nonnegative value; 
DO % (each CR user j executes Steps 3-8) 



Pg.j 



' Pl,j(^) = 



Pi, 



IF [Po,j \og{W~ + po,Ar)Ro,j) - Ao(r)po,,(r) 
[A,, log(W^- + pi,,(r)GJ?i,j) - Xi{r)piAr)] 

Set pj(r) = 1 and Pij{t) = 0; 
ELSE 

Set Pj(r) = and po,j{T) — 0; 
END IF 

MBS updates Ai(r + 1) as in l [38l l: 

r = T + 1; 
WHILE (ELo(^4^ + 1) - ^«(^))' > 



Since the Lagrangian function is differentiable, the gradient iteration approach can be used. 

K 

Ai(r + 1)= Ai(r)-sx 




,^ = 0,1, (38) 

where s is a sufficiently small positive step size and [■]+ denotes the projection onto the 
nonnegative axis. The updated Aj(r + 1) will again be used to solve the sub-problems, and 
so forth. Since the problem is convex, we have strong duality; the duality gap between the 
primal and dual problems is zero. The dual variables A(r) will converge to the optimal values 
as r goes to infinity. Since the optimal solution to (|36l) is unique, the primal variables p(r) and 
pjj (r) will also converge to their optimal values when r is sufficiently large. 

The distributed solution procedure is presented in Table Unl In the table. Steps 3-8 solve the 
sub-problem in (l36l) : Step 9 updates the dual variables. The threshold is a prescribed small 
value with < ^ 1. The algorithm terminates when the dual variables are sufficiently close 
to the optimal values. 

2) Case of Multiple Non-interfering FBS's : We next consider the case of > 1 non- 
interfering FBS's. The coverages of the FBS's do not overlap with each other, as FBS 1 and 2 
in Fig. [51 Consequently, each FBS can use all the available licensed channels without interfering 
other FBS's. Assume each CR user knows the nearest FBS and is associate with it. Let W, denote 
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the set of CR users associated with FBS i. The resource allocation problem becomes: 

K N 

maximize: ^PjPoj ^og{W~ + pojRoj) + QjPij ^og{W~ + pijGRij) 

j=l i=l j&Ai 

K 

subject to: ^ po,j < 1 
i=i 

Pj + Qj = 1' J = 



(39) 



iV 
K 



Pi,j, Pj, qj>0, 



N, J 



K. 



Since all the available channels can be allocated to each FBS with spatial reuse, problem (|39l ) 
can be solved using the algorithm in Table Hill with some modified notation: Pij{t) now becomes 
Pjj(r) and Ai(r) becomes Aj(r), i = 1, ■ ■ ■ , iV. The dual variables are iteratively updated as 

Ao(r + 1) = Ao(r) - s x [l - $^pS,,(r) 1 (40) 



A.(r + 1) 



Ai(r) - s X 1 



jeUi 




.N. 



(41) 



The modified solution algorithm is presented in Table |IV1 As in the case of single FBS, the 
algorithm is jointly executed by the CR users and MBS, by iteratively updating the dual variables 
Ao(r) and Ai(r)'s, and the resource allocations Poji^) and p,*j(r)'s. It can be shown that the 
distributed algorithm can produce the optimal solution for problem (l39l) . 
3) Case of Multiple Interfering FBS's: 

a) Formulation: Finally, we consider the case of multiple interfering FBS's. Assume that 
the coverages of some FBS's overlap with each other, as FBS 3 and 4 in Fig. [51 They cannot 
use the same channel simultaneously, but have to compete for the available channels in the 
transmission phase. Define channel allocation variables Ci^rn for time slot t as: 

1, if channel m is allocated to FBS i 
0, otherwise. 



(42) 



Given an allocation, the expected number of available channels for FBS iisGi = X]meyi(t) '^i,mPm- 
We use interference graph to model the case of overlapping coverages, which is defined below. 
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TABLE IV 

Algorithm for the Case of Multiple Non-Interfering FB S ' s 

1: Set r = 0, and Ao(0) and Xi{0) to some nonnegative values, for 
all i; 

2: DO % (each CR user j executes Steps 3-8) 



3: 
4: 

5: 
6: 
7: 
8: 
9: 
10: 



WT 



>io(t) Ro.j 



\i(r) Ri.jG 



IF [Po,ilog(Vy-+Po,j(r)iio,,)-Ao(T)po,jM] > 
[B,, log(W^- + p.Ar)GR,,j) - A,(r)p,,,(r)] 

Set Pj(r) — 1 and Pi.jir) — 0; 
ELSE 

Set Pj (r) = and poj(t) = 0; 
END IF 

MBS updates Ai(r + 1) as in Hoj and (4ll : 

r = r + 1; 



11; WHILE 



{j:lo{Mr+i)-Mr)r><i>) 
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Fig. 7. Interference graph for the femtocell CR network shown in Fig. |5] 



Definition 1. An interference graph Gj = [Vj, Ej) is an undirected graph where each vertex 
represents an FBS and each edge indicates interference between the two end FBS's. 

For the example given in Fig. [51 we can derive an interference graph as shown in Fig. |71 FBS 
3 and 4 cannot use the same channel simultaneously, as summarized in the following lemma. 

Lemma 4. If channel m is allocated to FBS i, the neighboring vertices of FBS i in the 
interference graph Gi, denoted as 7l{i), cannot use the same channel m simultaneously. 



Further define index variables as 



1, if FBS i is an endpoint of link k E Gj 
0, otherwise. 



The interference constraint can be described as J2iLi d-iCi^m < 1, for m = 



(43) 

M, and for all 
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link k G Gj. We then have the following problem formulation. 

K N 

maximize: ^ pjPoj log{Wf + pojRoj) + Cfj^iJ ^og{W~ + pijGiRij) (44) 

j=l 1=1 j&Ai 

K 

subject to: ^ poj < 1 

j&Ai 

Pj + ?i = 1> J = 1, ■ ■ ■ ,^ 

G = \^ r 7 = 1... TV 

mGA(t) 

N 

(i^'cj^m < 1, m = 0, ■ ■ ■ , M, for link k e G/, 

i=l 

Pi,j,Pj, Qj, Ci^m > 0, i = 1, - ■ ■ ,N, j = 1, - ■ ■ ,K, m = 0, ■ ■ ■ , M. 

Z?) Solution Algorithm : The optimal solution to problem (l44l) depends on the channel 
allocation variables Q m. Problem (l44l) can be solved with the algorithm in Table ITVl if the q m's 
are known. Let Q{c) be the suboptimal objective value for a given channel allocation c, where 
c = [ci, C2, ■ ■ ■ , cjv] and q is a vector of elements q „i5 for FBS i and channels m E A{t). If 
all the FBS's are disjointedly distributed with no overlap, each FBS can use all the available 
channels. We have = 1 for all i and m E A{t), i.e., it is reduced to the case in Section ITV-B 2 [ 
To solve problem (|44|) . we first apply a greedy algorithm to allocate the available channels 
in A{t) to the FBS's (i.e., to determine c). We then apply the algorithm in Table |IV] with the 
computed c to obtain a near-optimal solution. Let „i be a matrix with 1 at position {i, m} and 
at all other positions, representing the allocation of channel m E A{t) to FBS i. The greedy 
channel allocation algorithm is given in Table |Vl where the FBS-channel pair that can achieve 
the largest increase in Q(-) is chosen in each iteration. The worst case complexity of the greedy 
algorithm is 0{N^M^). 

c) Performance Lower Bound: We next present a lower bound for the greedy algorithm. 
Let e(/) be the /-th FBS-channel pair chosen in the greedy algorithm, and tt; denote the sequence 
{e(l), e(2), ■ ■ ■ , e(/)}. The increase in object value (|44l) due to the /-th allocated FBS-channel 
pair is denoted as 

:= A(7rz, 7rz_i) = Q{tii) - Qi-Ki-i). (45) 
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TABLE V 

Channel Allocation Algorithm for Case of Interfering FB S ' s 



1: 


Initialize c to a zero matrix, FBS set M = {1, • ■ • , A'^}, and 




FBS-channel set C = A/" x A(t); 


2: 


WHILE (C is not empty) 


3: 


Find FBS-channel pair {i',m'}, such that 




{i' ,m'} = argmax{(5(c + e;,™) - Q{c)}\ 




{i,m}6C 


4: 


Set c c ~\~ Gj' fYi' , 


5: 


Remove {i',m'} from C; 


6: 


Remove 7?,(i') x m' from C; 


7: 


END WHILE 



Since Q{txq) = (5(0) = 0, we have 

L 

= QinL)-Q{7iL-i) + --- + Q{7ii)-Qino) 

1=1 

= g(7rL)-g(7ro) = g(vri). 

For two FBS-channel pairs e(/) and e(/'), we say e(/) conflicts with e(/') when there is an edge 
connecting the FBS in e(/) and the FBS in e(/') in the interference graph G[, and the two FBS's 
choose the same channel. Let ^7 be the global optimal solution. We define ui as the subset of 
that conflicts with allocation e(/) but not with the previous allocations {e(l), e(2), ■ ■ ■ , e(/ — 1)}. 

Lemma 5. Assume the greedy algorithm in Table^stops in L steps. The global optimal solution 
Q can be partitioned into L non-overlapping subsets ui, / = 1, 2, ■ ■ ■ ,L. 

Proof: According to the definition of ui, the L subsets of the optimal solution ^7 do not 
intersect with each other. Assume the statement is false, then the union of these L subsets is 
not equal to the optimal set ^l. Let the set difference be cj^+i = \ (ufL-^^cui). By definition, 
col+i does not conflict with the existing L allocations {e(l), ■ ■ ■ , e(L)}, meaning that the greedy 
algorithm can continue to at least the (L + l)-th step. This conflicts with the assumption that 
the greedy algorithm stops in L steps. It follows that = U^^w;. ■ 
Let A(7r2, tti) = Q{7r2) — Q{ni) denote the difference between two feasible allocations tti and 
Ti2- We next derive a lower bound on the performance of the greedy algorithm. We assume two 
properties for function A(7r2,7ri) in the following. 
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Property 1. Consider FBS-channel pair sets vti, 7^2, and a, satisfying tti C and a fl = 0. 
We have A(7r2 U a, tti U a) < A(7r2, tti). 

Property 2. Consider FBS-channel pair sets vr, ai, and (J2 satisfying (Ji fl vr = 0, 0-2 H vr = 0, 

anJ o"i n (72 = 0. Wfe /zflve A(cri U a2 U vr, vr) < A(cri U vr, vr) + A(cr2 U vr, vr). 

In Property 1, we have cr fl tti = since tti C 712 and cr fl 7r2 = 0. This property states that 
the incremental objective value does not get larger as more channels are allocated and as the 
objective value gets larger. Property 2 states that the incremental objective value achieved by 
allocating multiple FBS-channel pair sets does not exceed the sum of the incremental objective 
values achieved by allocating each individual FBS-channel pair set. These are generally true for 
many resource allocation problems B4L 

Since we choose the maximum incremental allocation in each step of the greedy algorithm, 
we have Lemma |6] that directly follows Step 3 in Table IVl 

Lemma 6. For any FBS-channel pair o G w/, we have Q(t^i-\ U o") — Q{t^i-\) = A{7ii^i U 
o-,7r/_i) < A,. 

Lemma 7. Assume the greedy algorithm stops in L steps, we have 

L 

Q{Q) < Qi-KL) + ^ ^ A((T U 7ri_i, 7r,_i). 

1=1 a^uJi 

Proof: The following inequalities hold true according to the properties of the A(-, ■) function: 

g((ui,+,a;,) U vr;) = Qi{utl+2^^) U tt,) + A((ui,+ia;,,) U tt,, iuti+2^i) U tt,) 

< Q{{uti+2^i) U TTi) + A{uji+i U ni, 7ii) 

< Qi{Utl+2^i) U TTi+i) + A^+i U 

< Q((ui;+2^^i) U TT^+i) + A(aU7rz,7rz). 

We have ttq = and ul+i = (see Lemma [5]). With induction from / = to / = L — 1, we 
have Q((uiia;,) U 0) = Q{9) and Q{Q) < Q{TiL) + Ef=i E.g., ^(^ U vr^^i, vr^^i). ■ 

Lemma 8. The maximum size of ui is equal to the degree, in the interference graph Gj, of the 
FBS selected in the l-th step of the greedy algorithm, which is denoted as D{1). 
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Proof: Once FBS i is allocated with channel m, the neighboring FBS's in Gi, lZ{i), cannot 
use the same channel m anymore due to the interference constraint. The maximum number of 
FBS-channel pairs that conflict with the selected FBS-channel pair {i,m}, i.e., the maximum 
size of uju is equal to the degree of FBS z in G/. ■ 
Then we have Theorem [2] that provides a lower bound on the objective value achieved by the 
greedy algorithm given in Table IVl 

Theorem 2. The greedy algorithm can achieve an objective value that is at least t— 7^ — of the 
global optimum, where Dmax is the maximum node degree in the interference graph Gj of the 
femtocell CR network. 

Proof: According to Lemmas |7] and [8l we have: 

L L 

1=1 1=1 
= {l + D)Qi7iL), (46) 



where D = J2i=i J2i=i ^i- The second equality is due to the facts that J2i=i^i — 

To further simplify the bound, we replace D(l) with the maximum node degree D^ax- We 
then have D < Yi=i Dmax^il Ef=i = Dmax and 

— -^g(fi) < Q{7,l) < Qm, (47) 

J- ~r -L'max 

which provides a lower bound on the performance of the greedy algorithm. ■ 
When there is a single FBS in the CR network, we have Dmax = and Q{t^l) = Q{^) 
according to Theorem |2l The proposed algorithm produces the optimal solution. In the case of 
multiple non-interfering FBS's, we still have D^ax = and can obtain the optimal solution using 
the proposed algorithm. For the femtocell CR network given in Fig. |5] (with interference graph 
shown in Fig. |7]), we have Dmax = 1 and the low bound is a half of the global optimal. Note 
that (|46l) provides a tighter bound for the optimum than (|47l) . but with higher complexity. These 
are interesting performance bounds since they bound the achievable video quality, an application 
layer performance measure, rather than lower layer metrics (e.g., bandwidth or time share). 
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C. Simulation Results 

We evaluate the performance of the proposed algorithms using MATLAB and JSVM 9.13 
Video codec. Two scenarios are used in the simulations: a single FBS CR network and a CR 
network with interfering FBS's. In every simulation, we compare the proposed algorithms with 
the following three more straightforward heuristic schemes: 

• Heuristic 1 based on equal allocation: each CR user chooses the better channel (i.e., the 
common channel or a licensed channel) based on the channel conditions; time slots are 
equally allocated among active CR users; 

• Heuristic 2 exploiting multiuser diversity: the MBS and each FBS chooses one active CR 
user with the best channel condition; the entire time slot is allocated to the selected CR 
user. 

• SCA-MAC proposed in [35 J: with this scheme, the successful transmission rate is evaluated 
based on channel packet loss rate and collision probability with primary users; the channel- 
user pair with the highest transmission probability is selected. 

We choose SCA-MAC because it adopts similar models and assumptions as in this paper. Once 
the channels are selected, the same distributed algorithm is used for scheduling video data for 
all the three schemes. 

We adopt the Raleigh block fading model and the packet loss probability is between [0.004, 
0.028]. The frame rate is set to 30 fps and the GoP size is 16. The base layer mode is set to 
be AVC compatible. The motion search mode is set to Fast Search with search range 32. Each 
point in the figures presented in this section is the average of 10 simulation runs with different 
random seeds. We plot 95% confidence intervals in the figures, which are generally negligible. 

1 ) Case of Single FBS: In the first scenario, there are M = 8 channels and the channel 
parameters and P{q are set to 0.4 and 0.3, respectively, for all m. The maximum allowable 
collision probability 7m is set to 0.2 for all m. There is one FBS and three active CR users. 
Three Common Intermediate Format (CIF, 352x288) video sequences are streamed to the CR 
users, i.e.. Bus to CR user 1, Mobile to CR user 2, and Harbor to CR user 3. We have T = 10 
as the delivery deadline. Both probabilities of false alarm e and miss detection 6 are set to 0.3 
for all the FBS's and CR users, unless otherwise specified. 

First we investigate the convergence of the distributed algorithm. The traces of the two dual 
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Fig. 8. Convergence of the two dual variables in the single FBS case. 



variables are plotted in Fig. [8l To improve the convergence speed, the correlation in adjacent 
time slots can be exploited. In particular, we set the optimal values for the optimization variables 
in the previous time slot as the initialization values for the variables in the current time slot. 
By doing so, the convergence speed can be improved. It can be seen that both dual variables 
converge to their optimal values after 300 iterations. After convergence, the optimal solution for 
the primary problem can be obtained. 

Our proposed scheme achieves the best performance among the three algorithms, with up to 
4.3 dB improvement over the two heuristic schemes and up to 2.5 dB over SCA-MAC. Such 
gains are significant with regard to video quality, since a 0.5 dB difference is distinguishable by 
human eyes. Compared to the two heuristic schemes and SCA-MAC, the video quality of our 
proposed scheme is well balanced among the three users, indicating better fairness performance. 

In Fig. m we examine the impact of the number of channels M on received video quality. 
First, we validate the video quality measure used in our formulation by comparing the PSNR 
value computed using (|32|) with that computed from real decoded video frames. The average 
PSNR for three received videos are plotted in the figure. It can be seen that the real PSNRs 
are very close to those predicted by (l32l) . with overlapping confidence intervals. This is also 
consistent with the results shown in Fig. [6l Second, as expected, the more licensed channels, the 
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Fig. 9. Single FBS: received video quality vs. number of channels (computed with (9) and measured by PSNR). 



more spectrum opportunities for CR users and the higher PSNR for received videos. SCA-MAC 
performs better than two heuristics, but is inferior to the proposed scheme. 

We also plot the MS-SSIM of the received videos at the three CR users in Fig. [10] [33] . 
Similar observations can be made from the MS-SSIM plot. All MS-SSIMs for the four curves 
are more than 0.97 and very close to 1. The proposed scheme still outperforms the other three 
schemes. In the remaining figures, we will use model predicted PSNR values, since the model 
([32]) is sufficient to predict the real video quality. 

In Fig. [m we demonstrate the impact of channel utilization r] on received video quality. 
The average PSNRs achieved by the four schemes are plotted when i] is increased from 0.3 to 
0.7. Intuitively, a smaller r/ allows more spectrum opportunities for video transmission. This is 
illustrated in the figure where all the three curves decrease as i] gets larger. The performance of 
both heuristics are close and the proposed scheme achieves a gain about 3 dB over the heuristics 
and 2 dB over SCA-MAC. 

We also compare the MGS and FGS videos while keeping other parameters identical. We find 
that MGS video achieves over 0.5 dB gain in video quality over FGS video. The results are 
omitted for brevity. 
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Fig. 10. Single FBS: received video quality vs. number of channels (measured by MS-SSIM). 




Fig. 11. Single FBS: received video quality vs. channel utilization. 



2) Case of Interfering FBS's: We next investigate the second scenario with three FBS's, and 
each FBS has three active CR users. Each FBS streams three different videos to the corresponding 
CR users. The coverages of FBS 1 and 2 overlap with each other, and the coverages of FBS 2 
and 3 overlap with each other. 
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Fig. 12. Interfering FBS's: received video quality vs. number of channels. 



In Fig. [121 we examine the impact of the number of channels M on the received video quality. 
The average PSNRs of all the active CR users are plotted in the figure when we increase M 
from 12 to 20 with step size 2. As mentioned before, more channels imply more transmission 
opportunities for video transmission. In this scenario, heuristic 2 (with a multiuser diversity 
approach) outperforms heuristic 1 (with an equal allocation approach). But its PSNRs are still 
about 0.3 ~ 0.5 dB lower that those of the proposed algorithm. The proposed scheme has up to 
0.4 dB improvement over SCA-MAC. In Fig. [121 we also plot an upper bound on the optimal 
objective value, which is obtained as in (|46l) . It can be seen that the performance of our proposed 
scheme is close to optimal solution since the gap between the upper bound and our scheme is 
generally small (about 0.5 dB). 

Next, we examine the impact of sensing errors on the received video quality. In Fig. [131 we test 
five pairs of {e, 5} values: {0.2,0.48}, {0.24,0.38}, {0.3,0.3}, {0.38,0.24}, and {0.48,0.2}. It is 
interesting to see that the performance of all the four schemes get worse when the probability of 
one of the two sensing errors gets large. We can trade-off between false alarm and miss detection 
probabilities to find the optimal operating point for the spectrum sensors. Moreover, the dynamic 
range of video quality is not big for the range of sensing errors simulated, compared to that in 
Fig. [121 This is because both sensing errors are modeled and treated in the algorithms. Again, 
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Fig. 13. Interfering FBS's: received video quality vs. sensing error probability. 

our proposed scheme outperforms the two heuristic schemes and SCA-MAC with considerable 
margins for the entire range. 

We also investigate the impact of the bandwidth of the common channel Bq. In this simulation, 
we fix Bi at 0.3 Mbps and increase Bq from 0.1 Mbps to 0.5 Mbps with step size 0.1 Mbps. The 
results are presented in Fig. [Ml We notice that the average video quality increases rapidly as 
the common channel bandwidth is increased from 0.1 Mbps to 0.3 Mbps. Beyond 0.3 Mbps, the 
increases of the PSNR curves slow down and the curves get flat. This implies that a very large 
bandwidth for the common channel is not necessary, since the gain for additional bandwidth 
diminishes as Bq gets large. Again, the proposed scheme outperforms the other three schemes 
and the gap between our scheme and the upper bound is small. 

Next, we stop the distributed algorithm after a fixed amount of time, and evaluate the subopti- 
mal solutions. In particular, we vary the duration of time slots, and let the distributed algorithm 
run for 5% of the time slot duration at the beginning of the time slot. Then the solution obtained 
this way will be used for the video data transmissions. The results are presented in Fig. \T5\ It 
can be seen that when the time slot is 5 ms, the algorithm does not converge after 5%x5 = 
0.25 ms and the PSNR produced by the distributed algorithm is close to that of Heuristic 1, 
and lower than those of Heuristic 2 and SCA-MAC. When the time slot is sufficiently large. 
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Fig. 14. Interfering FBS's: received video quality vs. bandwidth of the common channel. 
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Fig. 15. Video quality achieved by the algorithms when they are only executed for 5% of the time slot duration. 



the algorithm can get closer to the optimal and the proposed algorithm produces better video 
quality as compared to the two heuristic algorithms and SCA-MAC. Beyond 20 ms, the increase 
in PSNR is small since all the curves gets flat. Therefore the proposed algorithm could be useful 
even when there is no time for it to fully converge to the optimal. 
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During the simulations, we find the coUision rate with primary users are strictly kept below 
the prescribed collision tolerance 7. These results are omitted for brevity. 

V. Conclusions 

In this paper, we first investigated data multicast in femtocell networks consisting of an 
MBS and multiple FBS's. We adopted SC and SIC for multicast data and investigated how 
to assign transmit powers for the packet levels. The objective was to minimize the total BS 
power consumption, while guaranteeing successful decoding of the multicast data at each user. 
We developed optimal and near-optimal algorithms with low computational complexity, as well 
as performance bounds. The algorithms were evaluated with simulations and are shown to 
outperform a heuristic with considerable gains. 

Next, we investigated the problem of streaming multiple MGS videos in a femtocell CR 
network. We formulated a multistage stochastic programming problem considering various design 
factors across multiple layers. We developed a distributed algorithm that can produce optimal 
solutions in the case of non-interfering FBS's, and a greedy algorithm for near-optimal solutions 
in the case of interfering FBS's with a proved lower bound. The proposed algorithms are evaluated 
with simulations and are shown to outperform three alternative schemes with considerable gains. 
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